Stochastic Optimal Control for Small Noise Intensities: The Discrete-Time Case

نویسنده

  • HUGO CRUZ-SUÁREZ
چکیده

This paper deals with Markov Decision Processes (MDPs) on Borel spaces with an infinite horizon and a discounted total cost. It will be considered a stochastic optimal control problem which arises by perturbing the transition law of a deterministic control problem, through an additive random noise term with coefficient epsilon. In the paper, we will analyze the behavior of the optimal solution (optimal value function and optimal policy) of the stochastic system, when the coefficient epsilon goes to zero. Specifically, conditions given in the paper guarantee the uniform on compact sets convergence of both the optimal value function and the optimal policy of the stochastic system to the optimal value function and the optimal policy of the deterministic one, when epsilon goes to zero, respectively. Finally, two examples which illustrate the developed theory are presented. Key–Words: Stochastic Optimization, Markov Decision Process, Dynamic Programming, Total Discounted Cost, Deterministic Approximation, Inventory/Production System

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Risk-Sensitive Control and Dynamic Games for Partially Observed Discrete-Time Nonlinear Systems

In this paper we solve a finite-horizon partially observed risk-sensitive stochastic optimal control problem for discrete-time nonlinear systems and obtain small noise and small risk limits. The small noise limit is interpreted as a deterministic partially observed dynamic game, and new insights into the optimal solution of such game problems are obtained. Both the risk-sensitive stochastic con...

متن کامل

Insurer Optimal Asset Allocation in a Small and Closed Economy: The Case of Iran’s Social Security Organization

We seek to determine the optimal amount of the insurer’s investment in all types of assets for a small and closed economy. The goal is to detect the implications and contributions the risk seeker and risk aversion insurer commonly make and the effectiveness in the investment decision. Also, finding the optimum portfolio for each is the main goal of the present study. To this end, we adopted the...

متن کامل

Systematic Perturbations of Discrete-Time Stochastic Dynamical Systems

The discrete-time stochastic optimal control problem is approximated by a variation of differential dynamic programming with systematic calculations of the perturbations due to small stochastic noise. This problem is related to the dual control aspects of stochastic optimal control problems. The motivation is to correct prior calculations for missing terms and to examine the foundations of the ...

متن کامل

Optimal discrete-time control of robot manipulators in repetitive tasks

Optimal discrete-time control of linear systems has been presented already. There are some difficulties to design an optimal discrete-time control of robot manipulator since the robot manipulator is highly nonlinear and uncertain. This paper presents a novel robust optimal discrete-time control of electrically driven robot manipulators for performing repetitive tasks. The robot performs repetit...

متن کامل

Discrete-time repetitive optimal control: Robotic manipulators

This paper proposes a discrete-time repetitive optimal control of electrically driven robotic manipulators using an uncertainty estimator. The proposed control method can be used for performing repetitive motion, which covers many industrial applications of robotic manipulators. This kind of control law is in the class of torque-based control in which the joint torques are generated by permanen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010